Phase Transition in a Noise Reduction Model: Shrinking or Percolation? 
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A model of noise reduction (NR) for signal processing is introduced. Each noise source puts a 
symmetric constraint on the space of the signal vector within a tolerable overlap. When the number 
of noise sources increases, sequences of transitions take place, causing the solution space to vanish. 
We found that the transition from an extended solution space to a shrunk space is retarded because 
of the symmetry of the constraints, in contrast to the analogous problem of pattern storage. For low 
tolerance, the solution space vanishes by volume reduction, whereas for high tolerance, the vanishing 
becomes more and more like percolation. The model is studied in the replica symmetric, first step 
and full replica symmetry breaking schemes. 
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In the past few years, the statistical mechanics of dis- 
ordered systems has been frequently applied to under- 
stand the macroscopic behavior of many technologically 
useful problems, such as optimization (e.g. graph par- 
titioning and traveling salesman) Q, learning in neu- 
ral networks ||, error correcting codes (|] and the K- 
satisfiability problem One important phenomenon 
studied by this approach is the phase transitions in such 
systems, e.g. the glassy transition in optimization when 
the noise temperature of the simulated annealing pro- 
cess is reduced, the storage capacity in neural networks 
and the entropic transition in the i^-satisfiability prob- 
lem. Understanding these transitions are relevant to the 
design and algorithmic issues in their applications. In 
turn, since their behavior may be distinct from conven- 
tional disordered systems, the perspectives of statistical 
mechanics are widened. 

In this paper we consider the phase transitions in noise 
reduction (NR) techniques in signal processing. They 
have been used in a number of applications such as adap- 
tive noise cancelation, echo cancelation, adaptive beam- 
forming and more recently, blind separation of signals 
H|). While the formulation of the problem depends on 
the context, the following model is typical of the gen- 
eral problem. There are N detectors picking up signals 
mixed with noises from p noise sources. The input from 
detector j is Xj — ajS+J^ft £j n ni where S is the signal, 
for [j, = l,..,p is the noise from the /ith noise source, 
and n M <C S. aj and ^ are the contributions of the sig- 
nal and the /ith noise source to detector j. NR involves 
finding a linear combination of the inputs so that the 
noises are minimized while the signal is kept detectable. 
Thus, we search for an N dimensional vector Jj such that 
the quantities £jJj are minimized, while . ajJj re- 
mains a nonzero constant. To consider solutions with 
comparable power, we add the constraint Y^,j Jj = N. 
While there exist adaptive algorithms for this objective 
here we are interested in whether the noise can be 
intrinsically kept below a tolerance level after the steady 



state is reached, provided that a converging algorithm is 
available. 

When both p and N are large, we use a formulation 
with normalized parameters. Let be the local fields 
for the /ith source defined by = Y^j £jJj/v~N. Learn- 
ing involves finding a vector Jj such that the following 
conditions are fulfilled, (a) < k for all \x— 1, ..,p, 
where k is the tolerance bound. We assume that the vec- 
tors £j are randomly distributed, with ((£,j)) = 0, and 

\^,iijj) = ^ij^nv Hence, they introduce symmetric con- 
straints to the solution space, (b) The normalization 
condition J2 3 Jj = N. (c) | J2 3 a jJj/vN\ — 1; however, 
this condition is easily satisfied: if there exists a solution 
satisfying (a) and (b) but yields | ^ a,jJj/VN\ different 
from 1, it is possible to make an adjustment of each com- 
ponent Jj proportional to sgnaj/\/N. Since the noise 
components £jr are uncorrelated with aj, the local fields 
make a corresponding adjustment of the order 
which vanishes in the large N limit. The space of the 
vectors Jj satisfying the constraints (a) and (b) is re- 
ferred to as the version space. 

This formulation of the problem is very similar to that 
of pattern storage in the perceptron with continuous cou- 
plings However, in the perceptron the constraints 
(a) are h) 1 > k, while there is an extra inversion sym- 
metry in the NR model: the version space is invariant 
under J — * — J. We can also consider the NR model as a 
simplified version of the perceptron with multi-state out- 
put, in which the values of local fields for each pattern 
are bounded in one of the few possible intervals. In the 
present model, all local fields are bounded in the sym- 
metric interval [—&,&]. This symmetry will lead to very 
different phase behavior, although it shares the common 
feature that the version space is not connected or not 
convex, with other perceptron models, e.g. errors were 
allowed j|, couplings were discrete || or pruned [fio|| , 
transfer functions were non-monotonic 

When the number of noise sources increases, the ver- 
sion space is reduced and undergoes a sequence of phase 
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transitions, causing it to disappear eventually. These 
transitions are observed by monitoring the evolution of 
the overlap order parameter q, which is the typical over- 
lap between two vectors in the version space. For few 
noise sources, the version space is extended and q = 0. 
When the number of noise sources p increases, the num- 
ber of constraints increases and the version space shrinks. 

One possible scenario is that each constraint reduces 
the volume of the version space, and there is a contin- 
uous transition to a phase of nonzero value of q. Alter- 
natively, each constraint introduces a volume reduction 
resembling a percolation process, in which the version 
space remains extended until a sufficient number of con- 
straints have been introduced, and the version space is 
suddenly reduced to a localized cluster. This may result 
in a discontinuous transition from zero to nonzero q. We 
expect that the transition takes place when p is of the 
order N, and we define a = p/N as the noise population. 
When a increases further, q reaches its maximum value 
of 1 at a = a c , which is called the critical population. 
The purpose of this paper is to study the nature and 
conditions of occurrence of these transitions. 

We consider the entropy 5, which is the logarithm of 
the volume of the version space and is self-averaging. Us- 
ing the replica method, S = lim„^o(((V")) — l)/n, and we 
have to calculate (V")) given by 



n „ N N p 

n/n^(E^ 2 -^)n^ 2 

„ — 1 " A — 1 A — 1 , 1 
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with h% = J2j JjCj/vN. Averaging over the input pat- 
terns, and using the Gardner method Q, we can rewrite 
(§ as (V"» = IUa<b=i d 1abexp(Ng). The overlaps be- 
tween the coupling vectors of distinct replicas a and 6: 
q a b = ^2f = iJjJj/N, are determined from the stationar- 
ity conditions of g. 

Due to the inversion symmetry of the constraints, it 
always has the all-zero solution (q a f, = 0,Va < 6), but it 
becomes locally unstable at a noise population 
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2 fc 2 exp(-fc 2 ) 
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For a > a at, the simplest solution assumes q a t — Q > 0. 
This replica symmetric solution (RS), however, is not 
stable against replica symmetry breaking (RSB) fluctu- 
ations for any q > 0. Hence, (^) is an Almeida- Thouless 
line [0 , and RSB solutions in the Parisi scheme |Q] have 
to be considered. 

The transition of q from zero to nonzero is absent in 
the problem of pattern storage in the perceptron, where 
q increases smoothly from zero when the storage level a 
increases 0. Rather, the situation is reminiscent of the 
spin glass transition in the Sherrington-Kirkpatrick (SK) 
model, which does possess an inversion symmetry 



The phase diagram is discussed in the following 3 
schemes: 

1. RS solution (RS, superscript ): It provides a good 
approximation of the full picture, which will be described 
later. The RS solution is given by q — qvA, where c/ea is 
the Edwards- Anderson order parameter . Close to the 
AT line, q where t=(a — q?at) / Q^at ^ 1 ■ The critical 
population, obtained in the limit <?—*!, is 



fc 



o^(fc) = l(l + fc 2 )(l-erf( — )) 



l fceX p ( _*!)j. (3) 



Two features are noted in the phase diagram in Fig. 1: 

a) The critical population line crosses the AT line: 
When fc = 0, the version space is equivalent to the solu- 
tion of p homogeneous iV-dimensional linear equations. 
Hence it vanishes at p — N, or = 1- The small k 
expansion of ^ and (||) gives 

a AT (fc) ~ l + 2fc 2 /3 < c4 0) (fc) ~ l + 4fc/V2^ , (4) 

and q increases smoothly from at a at to 1 at . How- 
ever, in the large k limit, the critical population grows 
exponentially with the square of the tolerance fc, and the 
reverse is true: 



2 Y exp( y ). (5) 



b) The first order transition line takes over the AT line: 
The paradox in a) is resolved by noting that for a given 
fc, multiple RS solutions of q may coexist for a given a. 
Indeed, a^(k,q) is monotonic in q for fc < fcc°' = 2.89, 
and not so otherwise. Among the coexisting stable RS 
solutions for fc > k^ , the one with the lowest entropy is 
relevant. Hence, there is a first order transition in q at a 
value a^\k), determined by the vanishing of the entropy 
difference between the coexisting stable solutions. The 
jump in q across the transition widens from at fc = kc , 
and when fc increases above k^ = 3.11, q jumps directly 
from zero to nonzero at the transition point. The line of 
(fc) starts from fc = k^ '; it crosses the line of q;at(^) 
at ko*\ replacing it to become the physically observable 
phase transition line. In the limit of large fc, the first 
order transition line is given by (fc) = 0.94a^\k), 
where q jumps from to a high value of 1 — 0.27fc~ 2 . 

The phase diagram illustrates the nature of the phase 
transitions. For low tolerance fc, each constraint results 
in a significant reduction in the version space, and there 
is a continuous transition to a phase of nonzero value 
of q. For high tolerance fc, each constraint introduces 
a volume reduction which is less significant, resembling 
a percolation process, in which the transition of q from 
zero to nonzero is discontinuous. However, even in the 
large fc limit, the region of high q spans a nonvanishing 
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range of a below the critical population, and the picture 
of percolation transition has to be refined in the next 
approximation. 



qo becomes zero. The line starts from k 
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1. Comparison of a c °\ a4 and qat- Inset: the pie- 



ce ^ and &q , 
II) q > 0. Shaded area: multiple solutions of q. 



ture within RS near k c and k^ , with 2 areas: I) q 
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2. First step RSB approximation (TtSBi, superscript 
^ ): Here the n replicas are organized into clusters, each 
with to replicas. q a b — qi for replicas in the same cluster, 
and q a b = Qo otherwise, and in the limit n — » 0, 0<m<l 
for analytic continuation. 

Two (qi, qo, to) solutions exist just above the AT line: 



(<7EA, 3<?EA, fn {oa >), (<?EA,0 



where (see later) 
m {00) is the position of the turning point in the Parisi 
function of the full RSB solution. Only the (q > re- 
solution is stable with respect to fluctuations of q . 
The features in the phase diagram in Fig. 2 are: 

a) First order transition: For k > — 2.31 mul- 
tiple solutions exist, and there is a first order transi- 
tion of q at a^\k). This line starts from k = kc , 
and crosses the line of q;at(&) at kjp = 2.61, to be- 
come the physically observable phase transition line. For 
large k, q\ = 1 — 1.23/(fcln£;) 2 at the transition, and 

a>i (k) = (1 — 0.33/ lnA;)ac (k). Hence, the gap between 
the lines of (k) and ac(k) vanishes, and the ex- 
tended phase disappears near the critical population via 
a narrow range of localized phase, resembling a percola- 
tion transition. However, the resemblance is not generic, 
since the approach of the two lines is logarithmically slow. 

b ) Reduced critical population: The critical population 
is reduced compared with otc (k). In the large k limit, 



/ tt/2 k lnfc exp(fc 2 /2) 



(6) 



c ) Transition of qo : The regime of nonzero qi and qo 
spans the region of low k below the critical population. 
At larger values of k, however, there is a line ct£) where 



line of critical population, and ends at k = k, 
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on the line of first order transition. Beyond this line in 
the localized phase, only the solution with qo = exists, 
reflecting that constraints with larger k are more likely to 
reduce the size of local clusters, but less likely to change 
the extended distribution of the clusters. 

The formation of localized clusters at the percolation 
transition is illustrated by the evolution of the over- 
lap distribution. For k > k^ at the first order tran- 
sition line, the parameter m decreases smoothly, with 
increasing a, from 1 across the phase transition line. 
This is analogous to the RS-RSBi transition in the ran- 
dom energy model Jl],[l^] and the high temperature per- 
ceptron Since the overlap distribution is P(q) — 

mS(q — qo) + (1 — m)S(q — q\), the transition is continu- 
ous in the overlap distribution, although discontinuous 
in terms of qo- At the transition, the statistical weight of 
the q = qo = component decreases smoothly, while that 
of the q = qi component increases from zero. When two 
points in the version space are sampled, they have a high 
probability to share an overlap qo, meaning that they 
belong to different clusters. Hence the version space con- 
sists of many small clusters. Furthermore, since qo = 0, 
the clusters are isotropically distributed. In contrast, for 
the transition at low k, the overlap has the same form as 
the SK model, i.e. a high probability being q±, implying 
that it consists of few large clusters. 




FIG. 2. The picture within RSBi, with the three areas: I) 
go = qi = 0, II) < go < qi < 1 and III) < gi < 1, go = 0. 
Inset: the region near kc and kg . Shaded area: multiple 
solutions of gi and go- 

3. Infinite step RSB solution (RSB^, superscript ( 00 ) / ): 
This is introduced since the RSBi solution is found to be 
unstable against further breaking fluctuations. Here the 
n replicas are organized into hierarchies of clusters. In 
the ith hierarchy, the clusters are of size TOj, and q a b = qi- 
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In the limit n — > 0, < ■ • • < m, < • ■ ■ < 1 for analytic 
continuation. In the Parisi scheme 111, the overlap q a b is 
represented by the Parisi function q{x), where x is the 
cumulative frequency of q, or P(q) = dx(q)/dq. We have 
only obtained solutions for a just above cxat'- 

q{x) = t <j p min(>/77i (oo) , 1) . (7) 

As shown in Fig. 3, the Parisi function q(x) is very sim- 
ilar to that of the SK model without external field near 
the critical temperature Q. 

For the behavior of the RSBoo solution far away from 
the AT line, we only give some qualitative features of the 
phase behavior. Below the AT line we have the extended 
phase characterized by q(x) = 0. For small k the AT 
line lies under the ai°^ (k) line, because of (f|) and af - 
a (°°> ~0(fc 2 ). Hence for a increasing above «at, there is 
a continuous transition of q(x) given by (Q), and ijea — > 1 
for a->al°°'(fc). 

For large k the AT line runs below the (fc) line, be- 
cause ai 00 " 1 (k)< oip (k)< af (fc) . Hence the ot£°^ (k) line 
must intersect the AT line, and there must exist a critical 
fcc 00 " 1 above which there is a first order transition from a 
solution with low (or zero) <3ea to one with high ^ea (de- 
noted by a^ 00 ' (k)). This line intersects the «at(^) line at 
a value kg 00 ^ , replacing it to be the physically observable 
phase transition line. The critical population ai°°\k) is 
obtained by taking the limit </ea — ► 1- 

We expect that the picture of version space percolation 
for large k continues to be valid in the RSBoo ansatz. 
This means that the oil (k) line will become arbitrar- 
ily close to the ai°°\k) line. When a increases above 
(k), q(x) will deviate appreciably from the zero func- 
tion only in a narrow range of x near 1 . This means that 
the overlap q is zero with a probability almost equal to 
1, and nonzero with a probability much less than 1. The 
corresponding picture is that the version space consists 
of hierarchies of localized clusters which are scattered in 
all directions of an N dimensional hypersphere. Again, 
the transition is continuous in terms of the overlap dis- 
tribution, though not so in terms of q{x). 




qEA 
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Fig. 3. The RSBoo solution near the AT line (a). For 
comparison, the RSBi solution (b: nonzero qo, c: zero 
qa) and the RS solution (d) are also plotted. 



The phase transitions observed here have implications 
to the NR problem. In the extended phase for low val- 
ues of a, an adaptive algorithm can find a solution easily 
in any direction of the N dimensional parameter space. 
In the localized phase, the solution can only be found 
in certain directions. If the tolerance k is sufficiently 
high, there is a jump in the overlap distribution, which 
means that the search direction is suddenly restricted on 
increasing a. For very high tolerance k, the picture of 
percolation transition applies, so that even though local- 
ized clusters of solutions are present in all directions, it is 
difficult to move continuously from one cluster to another 
without violating the constraints. 

Our results explain the occurrence of RSB in percep- 
trons with multi-state and analog outputs, when there are 
sufficiently many intermediate outputs |IJ,|l5|. In these 
perceptrons, the patterns to be learned, can be separated 
into two classes: p ln t have zero (intermediate) output, 
p cxt = (p— Pint) have positive or negative (extremal) out- 
put. The version space formed by the intermediate states 
has the same geometry as the model studied here, and 
consists of disconnected clusters for sufficiently large p. 
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